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Abstract 



We give an 0(log log /c)-competitive randomized online algorithm for reordering 
buffer management, where k is the buffer size. Our bound matches the lower bound 
of Adamaszek et al. (STOC 2011). Our algorithm has two stages which are executed 
online in parallel. The first stage computes deterministically a feasible fractional solu- 
tion to an LP relaxation for reordering buffer management. The second stage "rounds" 
using randomness the fractional solution. The first stage is based on the online primal- 
dual schema, combined with a dual fitting argument. As multiplicative weights steps 
and dual fitting steps are interleaved and in some sense conflicting, combining them is 
challenging. We also note that we apply the primal-dual schema to a relaxation with 
mixed packing and covering constraints. We pay the 0(log log k) competitive factor for 
the gap between the computed LP solution and the optimal LP solution. The second 
stage gives an online algorithm that converts the LP solution to an integral solution, 



while increasing the cost by an 0(1) factor. This stage generalizes recent results that 
gave a similar approximation factor for rounding the LP solution, albeit using an offline 



rounding algorithm. 



^ 1 Introduction 



In the reordering buffer management problem (RBM) an input sequence of colored items 
arrives online, and has to be rescheduled in a permuted output sequence of the same items, 
with the help of a buffer that can hold k items. The items enter the buffer in their order of 
arrival. When the buffer is full, one color present in the buffer must be chosen, and the items 
of this color in the buffer, followed by any new items of the same color encountered along 
the way, are scheduled in the output sequence one item per time slot, making room for new 
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input items to enter the buffer. The choice of color is made before future input items are 
revealed. Choosing a color and evicting items is repeated until we reach the end of the input 
sequence and we empty the buffer. The objective is to minimize the total number of color 
changes between consecutive items in the output schedule. This seemingly simple model, 
introduced in [18], formalizes a wide scope of resource management problems in production 
engineering, logistics, computer systems, network optimization, and information retrieval 
(see, e.g., P21 ED EH E5])- Moreover, beyond its simplicity, elegance, and applicability, the 
problem turns out to be challenging, and it captures some new and fundamental issues in 
online computing. We note that the offline version of RBM is NP-hard [61 [13] , and there is 
a polynomial time 0(l)-approximation algorithm [8]. 

This paper resolves the randomized competitive ratio of RBM. We design a randomized 
online RBM algorithm and prove that its competitive ratio is O (log log k). This matches the 
recent lower bound of Q(\og\ogk) of Adamaszek et al. [2]. All previous online algorithms 
for RBM are deterministic. A sequence of papers [IHl HSl d, [2] culminated in an 0(y/\og k)- 
competitive algorithm (2] , nearly matching the deterministic lower bound in the same paper. 
Thus, our work is the first to demonstrate an exponential gap between the deterministic and 
the randomized competitive ratio of RBM. 

In essence, our algorithm is an implementation of the primal-dual schema, and more 
specifically of the multiplicate weights update method (see [12] for a survey of its use in 
online computing, and [5 J for a general survey of the method). We compute online a feasible 
solution to an LP relaxation for RBM. This part is done deterministically, and it uses the 
same relaxation as in our past paper [7]. As we compute the LP solution, we feed it to an 
online rounding algorithm, which generates an integral solution on-the-fly. This part uses 
randomness, and is motivated by our recent paper [8] that gives a (deterministic) constant 
factor polynomial time approximation algorithm for RBM. 

One of the interesting aspects of our result is that we apply the multiplicative weights 
update method to a bipartite perfect matching- style linear program. Essentially all prior 
online results using this method (e.g. [U EDI El E] ) were derived through relaxations that are 
packing or covering linear programs, or small variations thereof (such as having additional 
box constraints). Another interesting aspect of our result is that, for reasons explained 
below, we cannot apply this method in its pure form. We must combine it with a dual fitting 
argument that is similar in spirit to that in [7]. Combining the two conflicting approaches 
into a hybrid primal-dual algorithm and proof is the main technical challenge of our work. 

What's unique about RBM and what separates it from other reputed online problems is 
the following. Usually, when an online algorithm makes an irrevocable decision, this may 
change the state of the system in a way that changes the cost of future decisions, but it only 
sets the current output. In contrast, when an RBM algorithm makes a decision to evict a 
color block from the buffer, this may decide the output for many steps ahead, as only one 
item can be evicted in one step. What's even worse, as items that are already in the buffer 
when the decision is made are evicted, new items of this color that arrive along the way 
can be appended to the evicted sequence, so the algorithm is perhaps deciding now how to 
handle future input it hasn't yet seen. Moreover, the size of the color block being evicted 
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could have a dramatic influence on the length of the sequence being evicted, and hence on 
future cost: if two algorithms differ by just one item between their buffer contents at some 
point in time, and they both decide to evict a color in which they differ, the algorithm with 
one less item might evict a much shorter sequence than the algorithm with one more item. 

Very recently, Adamaszek et al. [3] proposed a problem they call buffer scheduling for 
block devices, which is a variant of RBM that differs from it in one crucial aspect. In block 
devices, while evicting a color from the buffer, new arriving items of that color cannot be 
appended to the output sequence without incurring additional cost. For example, if the 
entire input sequence consists of a single color, an RBM solution pays 1, while a block 
device solution pays approximately Thus, their problem eliminates the issue of making 
decisions regarding unknown future items, but it still has to cope with the problem of making 
decisions regarding future output steps. They give an (log log /c)-competitive randomized 
online algorithm for buffer scheduling for block devices that implements the multiplicative 
weights update method (see p2]), using a covering LP formulation. 

Their result, which motivated our work and influenced part of it, overcomes the issue of 
deciding on future output steps by employing a resource augmentation argument, adapted 
from [H]. In [15} E] it is proved (for RBM, but the same proof applies to block devices) 
that if we replace a buffer of size k by a buffer of size |, then the optimal value of a solution 
cannot increase by more than an O(logfc) factor. If it were possible to reduce this factor 
to 0(1) for some constant factor decrease in the buffer size, we could derive our results 
with ease using a fairly simple implementation of multiplicative weights. Unfortunately, the 
0(\ogk) factor is tight [lj. What is used in [3 J is a straightforward generalization of the 
proof in [T3J, which shows (for block devices, but it is easy to generalize the proof also for 

RBM) that replacing a size k buffer by a size k' buffer, for k' = (l - ■ k, increases the 

optimal cost by a constant factor. 

Intuitively, resource augmentation equips the online algorithm with some lookahead of 
k — k! items, and this lookahead allows the algorithm to decide on up to k — k! steps into the 
future. Technically, this helps implement the online primal-dual schema in the two crucial 
parts of the argument. Firstly, the gap enables an initialization of the primal variables 
(triggered by the corresponding dual constraint becoming tight) to roughly instead of 
t, and this reduces the competitive ratio that one can hope for from log k to log(A;/(A; — k')). 
Setting the multiplicative weights update method to prove this claim is the main contribution 
of [3] to extending the method to handle RBM-style problems. Secondly, lookahead helps 
bound the rate of growth of the primal cost. In order to relate it to the rate of growth of the 
dual cost, we'd like to remove items (fractionally) from the buffer at a rate that is roughly 
the fractional volume that we've already removed from items that are still present with some 
weight in the buffer. This is easy if we only take into account the removed volume that 
already disappeared from the buffer. However, past decisions and current decisions extend 
into the future, and there is some volume that has been scheduled to be removed, but its 
removal hasn't yet happened. In particular, some items can no longer "participate in the 
game" despite being still in the buffer, because they've already been scheduled to be removed 
entirely from the buffer (this reduces the growth rate of the dual cost). At least in the case 
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of block devices, if there are never more than 0{k — k') items of a single color in the buffer, 
then this scheduled but not removed volume never exceeds 0(k — k'), and this bounds the 
growth of the primal cost adequately. However, even in the case of block devices, the buffer 
may contain very large color blocks. In [3] they overcome this problem by generating an 
infeasible primal solution. Thanks to the fact that in the block devices setting the future 
schedule does not include any future items, they are still able to round the infeasible solution 
to get a feasible integral solution. 

This approach does not work in the case of RBM. It can be made to work if the buffer 
never contains a color block larger than 0(k — k'), as in this case a feasible primal solution 
can be generated. On the other hand, if every color block in the buffer always has (at decision 
points) at least k — k! items, then (a simple version of) the deterministic dual fitting algorithm 
of [7] (and probably also earlier algorithms) guarantees a competitive ratio of 0(\og(k/(k — 
k'))). Thus, the main challenge and the main technical contribution of this paper, is to 
combine the two methods to give an 0(log(k/ (k — k'))) competitive ratio without restrictions 
on the instance. As at any given time the buffer can be in a "mixed" state with both small and 
large color blocks, combining these methods is non-trivial. A covering formulation similar to 
the one used in [5] cannot be used, to the best of our knowledge, so we are required to deal 
with a non-covering formulation that is harder to incorporate into the primal-dual schema. 
Adding to the challenge is the fact that the dual fitting argument inherently generates an 
integral solution, whereas the primal-dual schema inherently generates a fractional solution. 
Thus, we have to decide if a color block will turn out to be small or large before we know how 
many items of this color we can accumulate in the buffer. If we start removing it fractionally 
using the primal-dual schema, we cannot regret this decision later and switch to dual fitting. 
Rounding also poses its own challenge, mainly due to the above-mentioned feature of RBM, 
whereby premature eviction of a color may cost us a great deal later. (This is something 
that does not happen in the block devices problem.) 

The rest of the paper is organized as follows. After introducing some notation, definitions, 
and an overview in Section 121 we present our online primal-dual algorithm in Section [3] and 
analyze it in Section HJ Finally, we present the online rounding algorithm and analyze it in 
Section |5j The explicit constants in the rest of the paper are somewhat arbitrary. We made 
no attempt to optimize them. 



2 Preliminaries 

Let X be a sequence of colored items. We denote the color of an item i by c{i). Abusing 
notation, we denote the color of a sequence I of items of the same color by c(J). We denote 
by OPTfc(X) the cost of an optimal (offline) RBM schedule of X using a buffer of size k. 
The following lemma is adapted from [UJ [3]. For completeness, we include a proof in the 
appendix. 

Lemma 2.1. For every input sequence X and for every k' < k, OPTv(X) < 2fc+< - fc ~f ^ lnfc • 
OPT fc (X). 
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In our algorithm and analysis we use this lemma with k' = k — which increases the 
optimal cost by a constant factor. 

Consider a sequence / of items of a single color c in I that includes all the items of this 
color between the first and last item of /. If there is an RBM solution that outputs / starting 
at time j, we call the pair a batch. Thus, an RBM solution consists of scheduling or 

packing batches in the interval of output time slots {k + 1, k + 2, . . . , k + n}, where every 
output slot is used by at most one batch, and every input item is scheduled or covered by at 
least one batch. In other words, an RBM solution is a bipartite matching of input items to 
output slots. The matching must observe the order of input on each color separately, and an 
item cannot be matched to an output slot that precedes its arrival. The cost is the number 
of batches, where a batch is a maximal output interval that got matched to a set of items 
of the same color. This discussion leads us to a natural linear programming relaxation for 
RBM. We can think of the output slots as a channel of width 1 spanning the output interval 
{k + 1, k + 2, . . . , k + n}. We pack batches fractionally in this channel, without violating the 
width constraint, but covering all input items at least once. This relaxation is essentially 
identical to the one used in [7J [8]. We denote it simply by Formally, LP& is 

minimize xij subject to 

{1,3) 

X W - 1 

(i,j')-j'<i<i'+\i\ 

x > 0. 

Here, xjj is the weight of the batch in the packing. Constraints (pQ) require that every 
item is eventually removed from the buffer (in batches of total weight 1). Constraints (121) 
restrict the output to remove a total weight of at most 1 in each time slot. The dual linear 
program, which we denote by DP& is 

n k+n 

maximize yi — Zj subject to 

i=l j=k+l 

i+m-i 

Z>- E z f ^ 1 v ( 7 ^') ( 3 ) 

i£l j'=j 

y,z>0. 

Our algorithm computes online an LP^ feasible solution x and a DP^/ feasible solution 
(y,z). The algorithm feeds x, as it is being produced, to an online "rounding" procedure 
that produces an LP^ feasible integer solution x, which is the output of our online algorithm. 
Our main result, which the rest of the paper builds towards, is 

Theorem 2.2. There is an O (log log k)- competitive randomized online algorithm for RBM. 



Vz = 1,2, ...,n 



Vj = k + l,...,k + n 



(1) 
(2) 
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Proof. Theorem 13.11 establishes that the value of x is at most 0(loglogA;) times the value 
of (y,z), which is a lower bound on OPTV, and hence at most O(oPTfc) (by Lemma l2.ip . 
Lemma 15.11 establishes that the value of x is at most 0(1) times the value of x, and this 
concludes the proof of the theorem. ■ 



3 The Online LP Solution 

In this section we give an online algorithm that constructs a primal feasible solution x to 
LPfc and a dual feasible solution (y,z) to DP&/. In Section [4] we prove the following theorem. 

Theorem 3.1. 

(n k'+n \ 

2 y* - Y Z A ■ 
i=i j= k'+i J 

We construct simultaneously a feasible primal solution x, an infeasible dual solution (y, z), 
and an auxiliary dual penalty y. The construction of (y, z) uses a non-trivial implementation 
of the multiplicative weights update method, and y is generated by a dual fitting argument. A 
feasible dual solution (y, z) can be derived by scaling down (y+y, z) by a factor of 0(log log k). 
The algorithm maintains throughout its execution for every color c an index s c which is the 
earliest item of color c whose primal constraint ([T]) is violated, i.e., j)- s c ei x iJ < 

Notice that if a color c is not present in the buffer (for instance, c has not been encountered 
yet), the algorithm may not know s c . However, if the buffer has no item of color c, then the 
algorithm does not use s c , so this does not cause a problem. The algorithm further maintains 
the earliest output slot t whose primal constraint (j2J) is not tight, i.e., Yla jy j<t<j+\i\ x i,j < 1- 
Initially, t is set to k + 1 . 

The dual solution (y, z) is generated as follows. Initially, all dual variables are set to 0. 
The solution is parametrized by /i, which is raised at a uniform rate. We occasionally refer 
to jj, as time, but this should not be confused with the discrete input and output time steps. 
Further notice that even though for convenience we describe the algorithm as a continuous 
process, it can be discretized easily, and it can be implemented efficiently (regardless, com- 
petitive analysis is not concerned with computational efficiency). The algorithm raises all 
the variables yi with i > s c u) and all the variables 2j for j > t at the same rate dfi. (This 
raises also future y^s and Zj-s; when we reach them, we will initialize their value to what's 
determined by this process.) Notice that we raise the y,-s corresponding to violated primal 
constraints ([T]), and the Zj-s corresponding to primal constraints ([2D that are not tight. Rais- 
ing dual variables causes x to change, thus removing items fractionally or integrally from 
the algorithm's buffer. This eventually increments the s c -s and t, thus changing the set of 
dual variables that are raised. It also affects y. The process ends when t passes past time 
k! + n. At this point, we simply evict the remaining buffer contents, using the output slots 
up to time k + n. (Notice that this last step does not cost more than the total number of 
colors plus one; the total number of colors is a lower bound on the optimal cost.) 

We now explain how raising (y, z) affects x. At any given time, let B denote the set of 
items encountered so far, whose primal constraints are violated, and let B c denote the set 
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of items of color c in B. In other words, B c includes all the color c items that appear in the 
input sequence starting from s c and before the current slot t. Notice that for every item in 
B, at least a fraction of that item is still in the algorithm's buffer. There may be additional 
items in the algorithm's buffer. These are items that are already scheduled to be removed 
entirely from the buffer, but t hasn't yet passed the point where they disappear from the 
buffer. The items in B are endowed with one of three states: fractional, integral, or frozen. 
If any item in B c is integral, then they all are. Otherwise, the first ones are fractional and 
the remaining ones (if any) are frozen. We will refer to a set of items in B of the same color 
and the same state as a block. Thus, B c consists of either one or two blocks: an active block 
B® ct that is either fractional or integral, and a frozen block B{ rz that might be empty (and 
must be empty if B a c ct is integral). With a slight abuse of terminology, we sometimes also 
refer to all of B c as a block. These sets (B, B c , B a c ci , B[ rz ) are all functions of \x (and so are 
s c , t, x, y, z, y, and other variables defined below). 

Consider a dual constraint indexed and put c = c(J). Let 



denote the current dual cost of the batch Notice that we know this value at any time 

/x, even if the batch is matched to output slots we haven't yet reached. 

Fact 3.2. Consider a batch of color c. If^f- > then there must be an itemi G B C (~)I 

that is matched by to an output slot before the current time t. 

Proof. If all the items in B c n I are matched by at time t or later, then for all / G 

[j,j + |/| — 1], we have that f > t, so zy increases, and therefore ajj cannot increase. ■ 

The algorithm produces a primal solution x by scheduling batches of items, i.e., by raising 
xjj, for some batches (J,t), where t = t(fi). If we schedule a batch (J,t) of color c, then J 
begins with the items in B® ct (at the time /i when (J,t) is scheduled) and J could extend 
beyond B^ ct . We append a new item % of color c to J if and when the following becomes 
true: i's state is the same as the state of the previous items in J (when they were added to 
J), and we did not pass beyond the end of the current schedule of J. In particular, when an 
item extends J it is in B® ct . Specifically, we do not append i to J, even though it is in our 
buffer by the time we reach the end of the current schedule of J, if i G B[ rz at that time. To 
summarize, scheduling a batch of color c at time fi involves packing in the output stream, 
starting with output slot t = t(fj,), the sequence of items in B® ct = -B" c< (/^) (with a weight 
that cannot be greater than the remaining unscheduled weight of the first item in B® ct ), and 
later possibly extending this sequence with new items on-the-fly. 

The regular execution of the algorithm is to schedule continuously batches for every color 
c for which B® ct is fractional. The rate dxj tt at which we raise xj )t is governed by pseudo- 
dual cost variables &ij and pseudo-primal variables xij, defined for all batches We 
maintain the equation 
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We set 

^=max(^: c(7) = c(J)\ . 
djx {d/x J 

Notice we schedule batches simultaneously for all colors with fractional items in the buffer. 

In order to complete the description of the algorithm's regular execution, we need to 
explain how 07 j changes. Initially, 07 j is set to 0, and at certain events (see below) we reset 
&ij to 0. During an interval [/Xi, LI2] with no reset, &jj does not decrease. In order to explain 
the increase in ajj, consider the increase in 07 j when jx changes by an infinitesimal amount 
dii. Let t = t(fx), and let c = c(I). If t > j or if all the items in B c n I are matched by 
(I, j) at time t or later, then 07^ does not increase, and 07 j does not change. Otherwise, 
aij increases by d/x times the number of items in B c n I that are matched by before 
time t. In this case, 07 j increases by d/x times the number of items in B^ ct D I that are 
matched by (J, j) before time t. We will later see that in this case dajj > jt ■ dajj. We say 
that a batch (J,t) of color c that is scheduled during this increase is relevant to (the dual 
cost of the batch) Intuitively, if (J, t) is relevant to then when xj jt increases, 

xij increases by at most the same amount. Notice that if a batch (J,t) that is relevant to 
is scheduled without interruption (i.e., it never reaches an item that is frozen at the 
time slot it needs to be scheduled), then J includes the last item of /. 

Occasionally during regular execusion, we reset 07^ to 0. We call this a regular reset (to 
distinguish it from other resets that happen when regular execusion is interrupted). This 
happens in the following situation. Let the current time be \i. Let / = f(I,j) G 7 be the 
first item that interrupts (is not appended to) a scheduled batch (J,f) (f < t(/x)) that is 
relevant to because / G B[ rz when it needs to be appended. If at time \x the number 

of items in B c that arrived before / just dropped below ||-B C |, we reset 07 j to 0. Notice that 
we do this only for the first such item / G I, so for any batch (J, j), we do a regular reset at 
most once. We denote the time of the regular reset by Lia(I,j). If (/, j) never experiences a 
regular reset, we put /xo(I,j) = 00. Also notice that if &ij is reset to 0, automatically xij is 
reset to 0. 

Regular execution is interrupted in a few cases as follows. Upon interruption, we keep 
executing the valid cases until none of them hold, in which case regular execution is resumed. 

Case 1: A primal constraint (JTJ becomes satisfied, i.e., for some i G B, jy iei x i,j 
reaches 1. In this case we increment s c uy Notice that this also changes B^ ct . 

Case 2: A primal constraint fl2]) becomes tight, i.e., J2i x ij '■ J — * < 3 + 1-^1} reaches 
1. In this case, we increment t. Each new item that enters the buffer initializes its state 
as follows. If there are integral items of the same color, it enters the buffer as integral. 
Otherwise, it enters the buffer as frozen (however, the frozen state may change immediately 
due to the application of one of the following cases). 

Case 3: If \Bl rz \ > -. nr , fc . , for some color c, we schedule all the remaining volume of B? ct . 

1 c 1 100 m k ' o c 

(This may involve scheduling several distinct batches, and it also increments s c .) Then, we 
change the state of all the items in B[ rz to integral. (In particular this moves all of them to 
B® ct .) Finally, we reset &ij (and hence xij) to for all batches of color c. 

Case 4 : If Bc Ct is fractional and \B^ ct \ < w \ nk , we change the state of all the items in 
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B£ rz to fractional (in particular, they move to B^ ct ). 

Case 5: There is an integral block B® ct , and 07 j reaches 1 for a color c batch 
(Notice that taking into account the reset in Case 3 above, we can assume that J is a 
subset of B^ ct .) We suspend all fractional scheduled batches that haven't yet ended. We 
set yi = 2 | g 1 aet for every i 6 B^ ct . We schedule, starting at the current t, an integral ( 
weight- 1) batch with all the items in B^ ct followed by any items that can be appended to 
that block while it is being evicted from the buffer. We reschedule the unfinished portion 
of the suspended batches following this integral batch. Finally, we reset &ij (and hence 
xjj) to for all batches of color c. Notice that following this case, both s c and t are 

incremented by at least |5" c *|. 

Case 6: Since the last application of this case, we've moved past the end of regular 
execution fractionally scheduled batches of color c with total weight at least ^ (suspended 
batches are not considered to have ended). We apply the same procedure as in Case 5 to the 
block Bf rz , except that we don't raise y. To distinguish batches scheduled by this case from 
integral batches scheduled by Case 5, we will refer to the ones we schedule here as weight-1 
fractional batches. 



4 Analyzing the LP Algorithm 

We first observe that by the definition of the algorithm (Case 1 and Case 2) it constructs a 
feasible fractional solution. 

Observation 4.1. The primal solution x is a feasible solution o/lp^. 

We now bound the total volume of items in the buffer that the algorithm schedules at 
any given time while in regular execution. 

Claim 4.2. If Bf is fractional, then \B C \ < and \B% ct \ < j^k- 

Proof. By Case 3, if B^ ct is fractional, then \B{ rz \ < 100 fc lnfc - By Case 4, we keep new items 
of color c in B[ rz , unless \B^ ct \ < 1Q ^ nfc . If B^ ct drops below 10 ^ nfc , we move the items in 
B? rz to B^ ct , adding at most 100 fc ln fc new items, so \B® ct \ < k always (while fractional). 
Combining this bound with the bound on B[ rz , we get that \B C \ < 10 q 2 ^ k . ■ 

Corollary 4.3. At any time fi, the volume of items in B(fi) that is scheduled beyond time 
t(/i) — 1 is less than 1( ^^ n k . 

Proof. There is a total weight of less than 1 of scheduled batches that extend to time t = t(/i) 
and beyond (otherwise, Case 2 would have happened). Each of these batches is fractional, 
so by Claim I4T21 it has less than k items that are in B(fx). Therefore, the total volume 
that is scheduled of items in B(jj) is less than 10 Q 2 ^ fc . ■ 
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Claim 4.4. Consider the (partial) solution x at a time of regular execution of the algorithm. 
The total volume that the algorithm already scheduled of items that are currently in B is 
bounded by 

J2( X U ■ \ BnI \) < \B\-k'. 

(U) 

Proof. Let \i denote the current time, and let t = t(/i) denote the current output time slot. 
By Corollary 14. 3[ the total volume of items in B that is scheduled beyond time t — 1 is less 
than 10 Q 2 1 p k ■ The total volume that is scheduled before time t is exactly t — 1 — k. By the 
definition of B, exactly t — 1 — \B\ items are scheduled to be removed completely from the 
buffer. Therefore, the volume that is scheduled from items in B is bounded as follows: 



1 2k 

\Bni\) <{t-i-k)-{t-i-\B[ 



(1,3) 



100 In A; 



\B\-k + r = B - Ik' + —} + — < LB - k . 

1 1 100 In A; 11 V Ink J 100 lnfc 1 1 



Claim 4.5. For every batch it holds that xij < always. 

Proof. Let c = c(I). Notice that xij can increase beyond 1 only while B^ ct is fractional. 
Consider an interval [/ii, /i 2 ] of uninterrupted regular execution where 07 j increases by S and 
the output time slot is t. Let (J, t) be a scheduled batch relevant to Notice that xj tt 

is at least the total increase in xij during the interval [^1,^2] where xjj was set. We prove 
the claim by bounding the total increase in xjj for all relevant (J,t). 

First notice that the total increase for all Xjj that extend all the way to the last item in 
I must be at most 1. This is because after an increase of 1 the last item of I is no longer 
in B, and 07 j cannot increase further. (A batch that is suspended and later rescheduled is 
considered here as a single batch.) The only reason that we do not extend J to the last item 
of I is if some item along the way is frozen at the time it needs to be appended to J. If 
when we reach the end of J's schedule, we've accumulated a cost of at least of interrupted 
schedules since the last time a weight- 1 fractional batch of color c was scheduled, then we 
schedule a weight- 1 fractional batch beginning with B[ rz . Notice that the items of B[ rz are 
scheduled past where they are matched by so all the remaining items of / will be 

evicted from the buffer completely, and ajj will not increase further. 

So consider the first interrupted such (J, t) (so the first color c item following J is f(I,j)). 
At the time \i when the interruption occurs, all the items in B£ rz ([i) are not appended to any 
scheduled batch. Let fi' denote the time when the items that were in B^ rz (fi) are schduled to 
be removed entirely from the buffer (i.e., they are removed from B). Every scheduled batch 
that removes these items must schedule them after time slot t(/j), so unless such a batch is 
interrupted, it includes the last item of /. 

Suppose that between time /1 and time // no other scheduled batch of color c is inter- 
rupted. If t(n') is past the end of the first scheduled batch that removes B^ rz (fi), then this 
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batch is not interrupted and thus it includes the last item of /. Therefore, none of the 
scheduled batches that remove B[ rz (fi) are interrupted before they include the last item of 
/. Otherwise, at time // there must be a total weight of 1 of scheduled batches of this color, 
because each item in B^ rz (^i) is scheduled with total weight 1 in batches that begin past 
t(fx), and none of these batches are interrupted until t(fjf) (by our above assumption). In this 
case, at time fi' all of B® ct (fi') is scheduled to be removed completely from the buffer. Also, 
it must be that B[ rz ([i') = 0, because if B{ rz was non-empty just before //, it is moved to 
gact ^ ue £ Q Q ase 4 Q f algorithm (as \B® ct \ drops to 0). Therefore, while all these batches 
are still being scheduled, any new item of color c is appended to all of them and is thus 
removed from the buffer, so none of them are interrupted at least until the first one ends. 
As the first such batch that ends must include the last item of /, they all must include the 
last item of /. 

If there is an interruption between time /x and time //, repeat this argument for the 
new interrupted batch (J,t) and interrupting B{ rz . Notice that any such interruption must 
have the property that (J,t) must schedule the previous B{ rz past t(/x), and therefore past 
where it is matched by If we accumulate J2 Jt xj,t > jq °f interrupted (J,t) at some 

point, then we schedule a weight- 1 fractional batch, and by the argument above ajj does 
not increase further. Notice that in this case the last such Xj it < 1 so xjj < Otherwise, 
the weight of interrupted (J, t) is less than and the weight of uninterrupted (J,t) is at 
most 1, so xij < yi. (Notice that along the way we might have reset xij, but this can only 
decrease its value, and we analyzed aggregate increase xij.) ■ 

4.1 Dual feasibility 

The main technical difficulty is to show that the dual solution that the algorithm computes 
is a feasible solution. In order to prove this, we need to show that the constraints ([3]) are 
satisfied, namely that for every batch 

3+|/|-l 
»6/ j'=3 

We consider several cases in the following claims. These cases will be combined in the 
pursuing proof of Lemma [4. 10L Consider a batch The items in / are partitioned by 

the algorithm's execution into segments. A segment is a maximal substring of items with the 
same state when removed from the buffer. Thus, there are alternating fractional and integral 
segments. An integral segment consists of a block of items that were removed together in 
a single application of Case 5 of the algorithm. In between two integral segments (or an 
integral segment and an endpoint of /, or two endpoints of J) there is a fractional segment. 
We first deal with batches that do not contain an integral segment. 

Claim 4.6. For every batch for which all of I is one fractional segment, 

_^ 3+|/|-i 

(Tlj=^2yi- % = 0(loglogfc). 

»e/ j'=j 
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Proof. Denote c = c(I). Notice that ojj increases only when t > j and s c G / (this is 
a necessary but not sufficient condition). We bound the total increase in ajj, ignoring 
possible decreases along the way. Therefore, we may assume that J is a maximal set without 
an integral segment, because extending it backwards and forwards can only make the sum 
of increases larger. To see this, notice that if t > j, then extending / backwards adds 
items whose y value possibly increases, whereas its corresponding z value remains fixed. 
Extending / forwards adds items whose y value definitely increases (because s c G I), and its 
corresponding z value possibly also increases. 

Notice that there is at most one value /x = po(I,j) of \x where ojj (and therefore xjj) 
is reset to while s c G /, because / does not contain an integral segment. Recall that 
/ = f(I,j) is the first item in / that is in B[ rz when we need to append it to a relevant 
scheduled batch. Then fi is the smallest value of p for which \{i G B c : i > f}\ > \\B C 



Notice that whenever crjj increases by 5, then frjj increases by at least ■ 5. This is 
because the increase in crjj is incurred by all the items that increase aij, excluding those 
that are currently frozen. However, \B® ct \ > yy • \B C \ and if matches any item of B[ rz 
before the current time t, then it matches all the items of B^ ct before time t as well. (As 
j^act j g f rac tional and is maximal, B^ ct C I.) 

Notice that 

X I,3 ' m ^ X I,3 — Ejfc> 

1 + In xij + In In k otherwise. 

By Claim 14.51 xij < || always. Therefore, &ij < 2 + In In k always. Because ajj is reset at 
most once, we get that 07 j < 2 • ^ • max{&ij} < y + ^ • In In k. ■ 

The proof of the following property is useful in the rest of the analysis. Consider a batch 
of color c. Let Ji, J 2 , . . . J m be its integral segments (by the order of the matching). Let 
j r be the time that the first item of I r is matched by and let t r be the time slot where 

I r was scheduled by the algorithm. Denote by A r = j r — t r the difference between these 
times. We also denote by l r < \I r \ the number of items from I r that are in the algorithm's 
buffer when the algorithm decides to remove I r (starting at time slot t r ). 

Claim 4.7. For every batch with m integral segments, and for every 1 < p < m, we 

have that A p > Y^r= p +i^r- 

Proof. Notice that for every 1 < r < m, A r > 0. Otherwise the time slot where the algorithm 
starts removing the items in I r is after where they are matched by the batch Thus, 
the algorithm removes all the remaining items of /. This mean that there would be no more 
integral segments after I r . 

We now show that given 1 < r < m — 1, we have that A r+1 < A r — t r . At time t r + \I r \ 
when we reach past the end of I r : s eviction, there are no items of this color in B. Therefore, 
all the items of the following fractional segment (denoted by F r +i), and all the integral items 
that were in B at the time that segment I r+ \ was sheduled (starting at time slot t r +i), enter 
the buffer during the input interval [t r +|J r |+l, t r+ i]. Therefore, t r+ i > t r + |/ r | + |F r+ i|+£ r+ i. 
Combined with the fact that, jV+i — jr + \Ir \ + l-^r+i|> we get that 



^r+l — jr+1 ~ tr+1 < jr + \Ir \ + |-^r+l| — (t r + \Ir \ + l-^r+ll + ^r+l) — A r 
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This complets the proof as 



A p > A p+1 + £ p+ i > A p+2 + £ p+ 2 

p<r<m p<r<m 



Claim 4.8. Every batch such that the first item i £ / is in B at time j contains a 

constant number of segments. 

Proof. Let m be the number of integral segments in We start by showing that Aj < 

KXHn fc ' ^ e assume that the first segment of J is a fractional segment. Otherwise, as the 
first item i £ I is in B at time j, all of / is a single integral segment. Let F\ be the first 
fractional segment. We also assume that |Fi| > 1 Q^ nk as otherwise Ai < — k is triviall, 
because t\ > j and j% = j + \F\\. As there can be at most 1Q 1 Q 1 1 ^ k fractional items at time 
j, at least \Fi\ — 1Q j ) 1 1 ^ k items are added to F 1 after time j. Furthermore, there are at least 
100 fc ln k items from I\ that arrive before time slot t\. These items arrive after the items of F\ 
and therefore after time slot j. Therefore, t\ >j + \F\\ — + 100 fc lnfc > ji - iotn- Thus, 

in this case A 1 < 

We now use Claim 14.71 and get that 

Ek 
l r > (m - 2) — . 
r ~ y ' 100 In k 

Kr<m 

The upper and lower bounds on A x imply that m < 13. ■ 

Claim 4.9. For every batch such that for every item i £ / it holds that i £" B at the 

time it is scheduled by 

2 (& +Vi)- Yl ** = o o°& io & k )- 

iei t=j 

Proof. Let be a batch of color c with m integral segments that satisfies the conditions 
of the claim. For i £ I define Mjj(i) to be the output time slot where i is scheduled by 
For every item i £ J, let (ft + — z Ml ^ be the contribution of % to the pseudo-dual 
cost of (J, j). Note that the sum of all contributions over % £ I is exactly the left hand 
side of the claimed equation. Let F p be the fractional segment between I p ^% and I p . We 
assume w.l.o.g that I starts and ends with an integral segment, as every fractional item has a 
negative contribution, as we prove below. Let M a i g (i) be the time slot in which i is removed 
from B. Notice that for an item i £ I p , M a i g {i) = t p . For every integral segment I p , let f p be 
the time that the first items of I p are "defrosted" (i.e. they are moved from B[ rz to B^ ct ). 
We start with the following observations: 

(i) For every i £ I, fa - z MlJ (i) < -(z MlJ (i) - z Malg (i))- 

(ii) For every p, z tp - z fp > l/£ p . 
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Observation (i) follows as j)i < ZM alg (i)- Observation (ii) follows from Case 5. When the 
integral block I p is scheduled at time slot t p , the pseudo-dual cost 0/',/ of some batch (I',f) 
with c(I') = c reaches 1. Notice that the pseudo-dual cost &i',j> is reset to when I p is 
"defrosted" at time slot f p , and between f p and t p there are never more than £ p items of 
color c in B. Therefore, the rate at which apj' is raised in this interval is at most £ p ■ d/i. 
Therefore, \i increases by at least l/£ p in between f p and t p . 

Let m' be the maximum index for which t rn i < j (the lateset integral segment that was 
sheduled up to time j). Let j' = max{j, max ie / m , M a i g (i)}. (This is the maximum between 
j and the time slot where the algorithm removes the last item of I m '-) Notice that zy = 2j, 
because if f > j, the algorithm removes during the interval \j,f] part of the integral block 
I m ', and therefore the corresponding z-s do not increase beyond their set value when the 
removal began at time slot t rn > < j. We have that 

iei t=j 

Yl (Vi + Vi- ZM I}j (i)) + Yl (Vi + Vi- ZMiAQ) 

iei: M alg (i)<j> iei: M alg (i)>j> 

Y (Vi + Vi- %') - Y (*MiAQ - %o + 

iei: M alg (i)<j> iei: M alg (i)<f 

+ Y &~ Y (zM Itj (i) - ZM alg (i)) 

iei: M a i g (i)>j> iei: M alg (i)>j' 

m' 

< YY^ i+ y i ~ ^~ Y (^M IJ (i)-z j ) + 

p=l ielp iei: M alg (i)<j> 

r 

+ Y Y&~ Y ^M Ij( i)-ZM alg (i)). 

p=m'+l ielp iei: M a i g (i)>j> 

The second equality follows from observation (i). The inequality is explained as follows. From 
observation (i) we have that jji — zmj j(i) < 0. Therefore, if Mjj{i) < j', then y; L — zy < 0, 
so we can ignore these terms for fractional segments. Furthermore, for any i in a fractional 
segment, i/i — 0. 

We upper-bound the above right-hand side as follows. 
First part: Here we bound 

m' 

YY^+y*-^- ( 4 ) 

p=i ielp 

Notice that we bound the contribution of all the items before segment I m '+i assuming that 
they are all matched by the batch to time slot j. (In the next part we use the 

negative contribution that each of these items % accumulates between time slot j and time 
slot Mij(i).) Consider % e I p , p < m! . For every p < r < m' , we have that M a i g {i) < f r 
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and j > t r . Also notice that £ r _i < f r < t r . Therefore, using observation (ii) we have that 
% ~ ZM a i g {i) > T,T=p+i K ~ %■ > Y,7=p+i T r - Moreover, for every such segment I p there are 
£ p items i each with a positive contribution yi — to and at least £ p items with a 

negative contribution jji — Zj < z-M alg (i) — Zj < — Yl^p+i T ^° &■ The ^ as ^ hitegral segment 



I m i cobntributes at most | to d3|. Therefore, 

m' in' — 1 / m' ^ \ 

p=l ieip p=l \ P r=p+l r / 

Let (1 <) pi < p 2 < ■ ■ ■ < p s (< Tnf) be the indices of the segments that have a positive 
contribution to the right-hand side of (J5j). For every u = 1,2, . . . , s, j- > 2- YlT= Pu +i T — ^ ' 
^ — . Because for every p it must be that 100 fc ln k < i p < k, we conclude that s = O (log log k). 
Each segment I Pu contributes to the right-hand side of (jSD at most |. Therefore, 



E E + ^ _ %) = °( lo S lo § k )- 

p=i ie/p 



Second part: In this part we bound 



m 

E E^'" E (*M Itj (i)- z Malg (i))- (6) 

p=m'+lie/p iel: M a i g (i)<j' iel: M alg (i)>j' 

We start by noticing that for every p = ml + 1, m! + 2, . . . , m there are exactly A p items 
that are matched by the batch (I,j) in the interval [t p ,j p — 1] (by the fact that p > m' we 
know that t p > j). These items precede in the input any item in I p , and therefore they are 
scheduled by the algorithm (completely) before time slot f p . In particular, for any such item 
i we have M a i g (i) < f p . Observation (ii) implies that each such item accumulates in the 
interval [f p , t p ] a negative contribution to (JH]) of at least j-. Notice that the same items might 
accumulate a negative contribution from several such intervals for consecutive p-s. Further 
notice that if p > m' + 2, then f p > t p „i > j. Using Claim H~T1 to bound A p , we get that 

m 

E E (^.(0-%)- J2 ( s M Itj (i) - z Malg {i)) 

p=m'+l ie/p iel: M alg {i)<j' iel: M alg (i)>j' 

p=m'+2 V p 7 p=m'+2 \ p r=p+l 

where the extra term 1 accounts for the contributions of I m '+i and I m . We therefore get 
that only integral segments I p such that £ p > 2 ■ YlT= P \i ^ can have a positive contibution. 
As 100 fc lnfc <^p<k, there are at most O(loglogfc) such segments, and each segment adds at 
most \ to right hand side of the above inequallity. ■ 
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We are now ready to prove the main lemma. 



Lemma 4.10. The dual solution (y,z) is a feasible solution of LP y. 

Proof. Consider a dual constraint indexed (I, j). We partion the pseudo-dual cost 07 j + 
YlisiVi °f (Iij) t wo parts. Let i G I be the first item for which i e B at the time 
it is matched by Partition into two sub-batches (h,j), such that l\ 

contains all the items in / smaller than i, I2 contains the rest of Ps items, and j' = M/j(i). 
From Claim S3] the pseudo-dual cost of (h,j) is O (log log A;). Any integral segment of 
{Iz,f) contributes exactly | to ^2 iel2 y%- Only the last integral segment can have a positive 
contribution to <7r 2j ', as any integral block with positive contribution to a batch (/, j) evicts 
all the remaining items of /. Any fractional segment contributes at most 0(loglogA;) to 
<7j 2 j', by Claim [4761 Therefore, as the total number of segments in (i^j') is bounded by 
an absolute constant (Claim H~5j) . the total pseudo-dual cost of is also O (log log A;). 

We therefore conclude that the dual solution (y, z), derived by scaling down (y + y, z) by an 
appropriate factor of O (log log A;), is feasible. ■ 

4.2 Bounding the primal cost 

Here we bound the cost of the primal solution using the cost of the dual solution. 
Lemma 4.11. At the end, 

1 n k'+n 



X>/j = 0(i)- X> + X>- E 

(i,i) \i=i 1=1 j=k'+i 



Proof. We partition the primal cost of the algorithm into three parts, according to the reason 
for incurring the cost. 

Part 1 (regular execution): Consider an increase d\x in \x during regular execution, and 
let t be the current output time slot. By the definition of the algorithm, the dual variables 
that are raised at time /x are {y~i} ieB , {yi} t <i< n i an d {%}*<,<*/+„• Therefore, 



7 / s~^n ». Y~ , ' c '+ n 
11 ^Z^i=l Vi ~ l^j=k' + l 

dfi 



\B\ + n - t + 1 - (A/ + n - t + 1) = \B\ - k' . 



Let x, x be the (partial) primal and pseudo-primal solutions at time \i. For every color c 
for which B^ ct 7^ is fractional, let (I c ,j c ) be the batch of color c that maximizes (i.e., 
at time \i the rate of increase of xij dictates the rate at which color c is removed from the 
buffer). Notice that during regular execusion, if xjj increases then B^ must be fractional. 
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Thus, by definition, 



fractional C 



dfi t , dfi 

1 daicjc ( ^ ^; c j c 



< 



V — ■ UUhJc + V -t, 

^ In A; ^ du /cJc 

E ^+ E i^m,.,. 



c: <3- /c , jc >l 

We bound the first term as follows: 



|£g ct n I c \ < \b\ < k , 

< J In h In h 



In fc In fc 

The last inequality follows from the fact that the volume in the buffer of items in B is at 
least k — Yo^i ( an immediate consequence of Corollary 14. 3p . Because the volume in the 
buffer of items in B is at least k — , J^. k , also \B\ > k — 1f l 2 . fc , . So, 

lOOlnfc I I — lOOlnfc ' 

\ B \-h> = \B\-(k--^-) + (k--^-)-(k 2k 



lOOlnfc' v lOOlnfc' v lnfc' 
12k , k 



lOOlnfc' lnfc 



> 



> 



1^1 ~ (lOOlnfc) + k 

In fc 

151 



In fc 

We now bound the second term ^ c<5 . >i \^c Ct ^Ic\ • x i ct j c . We show that for every color 

c, 

\Bf n J c | ■ x Ic , jc < 0(1) ■ (*u ■ \ B c n /|) • (7) 

The difficulty in proving this is the following. Any increase in xj c j c lower bounds the increase 
in xij, if the batch is relevant to (I c ,j c ) at that time. In this case, it is possible to 

extend the scheduled batch to include all the items in B c D I c - However, the batch 

might terminate before removing all those items because it reaches an item that is in B{ rz 
at the time it needs to be scheduled. The regular reset of Xj et j c takes care of this problem, 
as we show below. 

In order to show Inequality (J7J), we consider three cases, according to the current value 
of fi. The first case is when fi < fi (I c ,j c ) (i.e., before xj c j c experiences a regular reset). 
Notice that every batch that is relevant to (I c ,jc) and is scheduled starting at some 

time fi' < fi, removes more than half of the items in B c (fi), because at the current fi less 
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than half of B c (p) arrived after the first item / = f(I c ,j c ) that causes any interruption in a 
relevant scheduled batch. In this case, 



B^(p)nl c \-x Icdc < \B e (jM)\-x Iei 



•3c 



< \B c {n)\- Yl Xl >i 

relevent 

< 2.^|5 c (//)n/ 



• x i,j- 



(1,3) 



The second case is when p (I c ,j c ) < p < pi(I c ,j c ), where pi(I c ,j c ) is the time at which 
/ is scheduled to be removed completely from the buffer. Notice that xi c j c is reset at 
A*o — fi>o(Ic,jc), so &i c ,jc — Yl(ij) x iJi where the sum is taken over relevant that are 

scheduled after time Pq. Consider such If is never interrupted (something that 

might happen if B{ TZ ^ at the time we reach the end of /), then clearly B^ ct (p) D I c C 
\B c (p) fl I\. Otherwise, let p! = p' T j denote any point in the time interval where (J, j) 
was scheduled (the sets don't change during that interval). Less than half the items in 
B c (p ) arrived before /, so this remains true also for B c (p'). As p < pi(I c ,j c ), we have that 
/ G B c (p). Set p" = n"j j to be the minimum time in [p',p] when Bl rz (p") ^ 0. If no such 
time exists, set p" = p. Clearly, \B™ *(//') | > ^\B c (p")\. Let F = {/,/ + 1, / + 2, n} 
(i.e., the input items starting with /). Now, \B c {p") fl F| > | • \B c (p")\, so 

|a c «V) n F| > Q - 1) ■ \B c (p")\ > \ ■ \B c (p")\. 

Clearly, (J, j) schedules all of B a c ct (p"). Notice that B a c ct (p") RFC F c (^) and |S C (^)| < 
y| • |F c (/i")|. Combining everything together, 



B a c ct (p)nl c \-x Ic , Jc < \B c (p)\-x L 



< 



< Ei iw, 

< E^I^V^nFl-^ 

The last case is when p > pi(I c ,j c ). In this case, consider all the scheduled batches 
that include /. Their total weight is 1, and they've all been scheduled before the current 
p. Because / interrupted a relevant batch, all these batches must be relevant. A weight of 
less than ^ Q f these batches is interrupted before time p, otherwise we would have executed 
Case 6, removing all the remaining items of I c . This contradicts the definition of (I c ,j c ) as 



18 



the batch that currently, at time fi, controls the rate at which color c is evicted from the 
buffer. Thus, a weight of at least yjj of the scheduled batches that include / schedules at 
time /i all the items in 5" ct (/i) nl c . On the other hand, by Claim |4T5| xj c j c < ^j. Therefore, 

\Bf{ii) n i c \ ■ x Ictjc < y • E \ B c ct ^ n J l • ^ y ■ E l 5 ^) n J l • 

(/J) {1,3) 

Therefore, regardless of the value of the current time ^, 

£ |srn/ c i-^ c <3- £ £ |B e ni| •*/,,■< 3- (|B|-*0, 

c:»i c j c >1 c:a IcJc >l 

where the last inequality follows from Claim 14.41 Thus, summing the bounds on the two 
terms, 

J < 4 • (\B\ - k ) < A 



dfi d\x 

which implies trivially that the total primal cost due to regular execution of the algorithm 
is at most 4 times the dual cost. 

Pari 2 (Case 3 and Case 5 execution): Each time an integral block is evicted (Case 5), 
Yh=i V% is raised by \. Preceding each such eviction there is a specific Case 3 execution, when 
the block became integral. These Case 3 and Case 5 executions incur together a primal cost 
of at most 3. (Case 3 evicts a color from the buffer at a cost of at most 1. Case 5 schedules 
an intergal block, and may suspend fractional batches of toal weight 1. So the cost of Case 5 
is at most 2.) Therefore, the total primal increment due to Case 3 and Case 5 is at most 

Part 3 ( Case 6 execution): Case 6 costs at most 2 (just like Case 5). Each time we execute 
Case 6 on color c, we've moved past the end of regular execution fractional scheduled batches 
of color c with total weight at least ^. After the end of this eviction, B does not contain 
any color c items, therefore the next Case 6 execution is due to distinct fractional scheduled 
batches. Therefore the primal increase as a result of Case 6 is at most 20 times the primal 
increase due to regular executions. By the above analysis of regular execution, this incurs a 
cost of at most 80 times the dual cost. ■ 



5 Online Rounding 

In this section we give a randomized online algorithm that rounds the fractional solution x to 
an integral solution for the reordering buffer management problem. The rounding algorithm 
presented here is inspired by our deterministic offline rounding algorithm in [Sj. Here we use 
randomness to replace the knowledge of future input that is needed in [8]. At each step t 
where our rounding algorithm needs to choose a color to evict, it uses only the input up to 
time t and the fractional solution x that we computed up to time t. Thus, our randomized 
online algorithm for reordering buffer management repeats two alternating steps: (i) Extend 
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the fractional solution deterministically up to the current time, (ii) Evict from the buffer 
using randomness some items chosen based on the current partial fractional solution. This 
increments the current time to the next vacant output slot. 

5.1 The rounding algorithm 

The algorithm works in phases. The first phase begins at time k + 1. In the beginning of 
a phase, the algorithm chooses one or more color blocks to evict, based on the fractional 
solution x that was computed up to the output time slot t where the phase begins. Then, 
the algorithm evicts the chosen blocks, and a new phase begins. Notice that in order to 
execute the next phase, we need to extend the fractional solution x to the new time slot 
that we have reached, taking into account the new input items that have entered the buffer 
during the last phase. 

In choosing the colors to evict in a phase, we consider four cases. Let 5 > be a 
sufficiently small constant, and let to be the starting output time slot of the current phase. 
More precisely, the fractional solution computed so far fully uses the time slots up to at least 
t , whereas the integral solution computed so far extends up to time slot t — 1- 

Case 1: The buffer contains an item from which the fractional solution removed so far a 
weight of at least 5. We evict the color block of this item. 

Case 2: The total weight of the items that the fractional solution schedules in the time 
slot t an d are also in our buffer is at least 25. We choose one such item at random with 
probability proportional to the weight it is removed at time to, and we evict its block. 

Case 3: A weight of more than | of the items that the fractional solution schedules at 
time t belong to a single color c that we just evicted from our buffer (i.e., the integral 
solution evicts at time slot t — 1 an item of color c). In this case we first choose color blocks 
to evict according to the following procedure, and then we evict all these blocks in arbitrary 
order. 

We now describe the procedure for choosing color blocks to evict in Case 3. Besides 
choosing color blocks, the procedure also "locks" some volume fractionally scheduled before 
time to- Any volume that is fractionally scheduled starts unlocked. Locked volume is assigned 
to a specific evicted block, and when the weight in the fractional buffer of an item in this 
block drops below 1 — 5, the volume assigned to this block becomes unlocked again. 

We partition the colors into classes according to the number of items in the buffer of each 
color at time to- A color c is in class s = 1, 2, . . . , log k + 1 iff the number of items in the buffer 
of color c is in [2 s-1 , 2 s ). Next, we partition the classes into subclasses as follows. For every 
color c let w c denote the average over the color c items in the buffer of the unlocked volume 
that the fractional solution scheduled for this item before time to- Let W s denote the sum 
of w c over all colors c in class s. To construct a subclass, we collect blocks until their total 
w c weight exceeds 5. In a class, we construct disjoint subclasses using this process while the 
remaining weight is at least 5. Notice that because w c < 5 for every color c, the total weight 
of a subclass is in [5,25). Also notice that in each class we might have colors with total w c 
weight of less than 5 that are not assigned to subclasses. We ignore those colors. If W s < 5 
then no block of class s is chosen. In each subclass, we choose at random one color block to 
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evict. The probability of choosing a color c is proportional to w c . The chosen block locks all 
the unlocked volume in the subclass. Finally, we also choose the largest color block in our 
buffer (This block takes care of the excess weight that we ignored in the above choice.) 

Case 4-' If all else fails (i.e., for all previous cases, the conditions for executing the case do 
not hold), we choose the largest and second largest color blocks, and also apply the Case 3 
procedure that chooses more colors. If after evicting the largest or second largest color block 
one of the other cases applies, we terminate the phase without evicting the remaining chosen 
blocks. (If we don't get to evict the Case 3 procedure choices, we annul the locks generated 
by the choice.) We stress that we choose all the blocks to evict in this case according to the 
situation at time to, but some of the chosen blocks might end up not being evicted. 

5.2 Performance guarantees 

We show that the cost of the integral solution generated by the rounding algorithm is within 
a factor of 0(1) of the cost of the fractional solution generated by the primal-dual algorithm. 
The main idea of the proof is the following. Evicting a color block increases the cost of the 
integral solution by 1, and we would like to change this cost against an increase by some 
(small) constant of the cost of the fractional solution. The blocks evicted due to the procedure 
in Case 3, excluding the eviction of the largest block, are handled separately (see Claim |5T2"|) . 
All the remaining evictions amount to a constant number of blocks evicted per phase. We 
show that for an expected constant fraction of the phases, we can find batches that were 
scheduled by the fractional solution with the following properties: (i) These batches do not 
stretch beyond the time slot reached by the integral solution in the corresponding phase. 
(ii) Their total weight is at least 5. (Hi) They were not selected more than once in previous 
phases (excluding the charging of the Case 3 procedure). This, together with the Case 3 
charging scheme, implies the following guarantee. 

Lemma 5.1. The expected cost of the solution generated by the rounding algorithm is 0(1) ■ 
Ylij x i,j> where x is the primal solution generated by the primal-dual algorithm. 

Proof. We consider the four cases that define a phase that begins at time t . In the first 
two cases our charging scheme is easy to achieve. In Case 1, an item i that is evicted in this 
phase is scheduled in the fractional solution before time to in batches of total weight at least 
5. Because each such batch matches i to an output slot before to, all of these batches end 
before the end of the current eviction of c(i). So we charge this phase to the cost of > 5 of 
those batches. 

In Case 2, consider the fractionally scheduled batches with an item scheduled at time to- 
Let t\ be the earliest time when the subset of these batches that have ended by time t\ has 
total weight of at least 5. Thus, the weight of the subset of these batches that reaches time 
t\ is at least 1 — 5. Consider the subset of batches that schedule at time to an item that is 
in our buffer at that time. This subset has total weight w > 25. Therefore, the total weight 
of batches that schedule at time t an item in our buffer and also reach time ti is at least 
w — 5 > 5. The probability that the algorithm chooses to evict a color block of one of these 
batches is at least 1 — — > |. If this event happens, the current phase ends past t\, and we 
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charge the phase to the weight of at least 5 of batches that use t but end at or before t±. 
If our choice is unsuccessful, we don't charge the phase. This happens with probability at 
most \. 

If we execute neither Case 1 nor Case 2, then for every item in our buffer, the weight of 
this item that the fractional solution scheduled before time t is less than 5. Also, a weight 
of more than 1 — 25 scheduled by the fractional solution at time t is of items no longer in 
our buffer. Notice that these items must have appeared in the input prior to their removal, 
so we've already placed them in the buffer and evicted them in the past. 

By the definition of t±, it's still true that in the fractional solution the total weight of 
batches whose schedule contains the interval [t ,ti] is at least 1 — 5. Let A denote the total 
volume of the content difference between our buffer and the fractional buffer (i.e., of the 
items in our buffer the fractional buffer lacks a total volume of A, and symmetrically of the 
contents of the fractional buffer, a total volume of A belongs to items we no longer hold). 
Let t' > to denote the earliest time where at least a weight of 25 of the fractionally scheduled 
batches that reach t\ schedule an item that arrived at time to or later (i.e., items we haven't 
seen yet). 

Assume for now that Case 3 does not hold. If our buffer at time t contains one or two 
colors that together have more than t' — t items, then the eviction of the two colors chosen 
in the first step of Case 4 makes us reach t'. Notice that we reach t' just by removing the 
items of these colors that are already in our buffer at t — 1. However, as we evict each color, 
additional items of this color that enter the buffer might be appended. If we reach t±, we 
can charge this phase as in Case 2. If we haven't reached ti, then Case 2 now applies for the 
following reason: our buffer at time to contains more than t' — to items that are evicted. We 
advance beyond t' by at least the number of items that arrived after time to that we remove. 
Thus, if we haven't reached ti, there is still a weight of at least 25 of fractionally scheduled 
batches stretching to ti with the current item in our buffer. Therefore, the next phase will 
be charged with probability at least \ (because it executes either Case 1 or Case 2). We do 
not charge this phase. 

So let's assume that there are no such colors. Let 7 > be a sufficiently large constant. 
Suppose that A < (t' — to)/7- By our assumptions, between t and t' there is a total volume 

> (1 — 35) • (t' — t ) that the fractional solution schedules of items that arrived before time 
to- This is because at most 25 (f — to) of the volume t' — to belongs to items arriving past 
t in fractionally scheduled batches that reach t±, and another at most 5(t' — t ) belongs 
to batches that don't reach ti (regardless of when their items arrived). Of this volume, 
more than (1 — 35 — I/7) • (t' — to) must still be in our buffer at time t . Consider the 
fractionally scheduled batches of total weight at least 1 — 5 whose schedule contains the 
interval [to,ti] (which includes t'). At least | — 5 of this weight belongs to batches that 
begin with no more than 4(t' — to)/7 items no longer in our buffer at time to- Otherwise, the 
total volume of items that are no longer in our buffer but are still in the fractional buffer is 

> j ■ 4(t' — t )/7 = (f — t )/7 > A, a contradiction to our assumptions. 

Consider these batches of total weight at least | — 5. In the interval [t + 4(t' — to)/7, t'], 
they contain only items that are either in our buffer at time to or arrive past to- But only less 
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than 25 of this weight belongs to batches that contain, up to time t', any item that arrives 
past to (as their schedule all reach ti). So there's a weight of at least | — 35 of these batches 
that in the interval [t + 4(t' — to)/7,t'] contain only items that are in our buffer at time to- 
Notice that for every color that appears in these batches, our buffer in the beginning of the 
phase contains at least (1 — 4/7) ■ (t' — to) items of this color. Assuming that 7 is sufficiently 
large, (1 — 4/7) ■ (t' — t ) > (£' — t )/2. If there two different colors, then our buffer at time 
to contains one or two colors that together have more than t' — to items, a contradiction to 
our assumptions. Thus, all these batches belong to the same color c. The number of items 
of color c in our buffer is at least (1 — 4/7) • (£' — t ) > 4(t' — t )/j. 

Recall that by our assumptions so far, we execute in the current phase Case 4. If there 
is a color in our buffer with more items than c, then after evicting the largest color one of 
the following two possibilities happens. If we've reached ti then we charge this phase as in 
Case 2. Otherwise, more than half the weight that the fractional solution now removes is 
on items of color c that we currently have in the buffer. Therefore, we will next execute 
either Case 1 or Case 2. We do not charge this phase, and the next phase is charged with 
probability at least |. 

If we choose to evict c (because it has the maximum number of items in the buffer) 
and we don't reach t 1; we end up with no items of color c in the buffer, and a weight of 
> I — 35 > \ is now being removed by the fractional solution from items of color c. In 
particular, this means that Case 3 holds, so in the next phase we definitely will not execute 
Case 4 again. (This scenario is precisely the reason for defining Case 3.) We do not charge 
this phase. If in the next phase we execute Case 1 or Case 2, then the next phase is charged 
with probability at least |. Otherwise, in the next phase we execute Case 3, and as we show 
below, a Case 3 phase is either charged or followed by a Case 1 or Case 2 phase, which is 
charged with probability at least |. 

We now analyze the remaining Case 3. Recall that Case 3 is invoked if the fractional 
solution removes at time slot to a weight of at least | of items of a color that we've just 
evicted. Define t' = t + M, where M is defined as follows. Consider the color c batches that 
pass through t . (Recall that we've just evicted color c.) Each of these batches begins (at 
time slot to) with one or more items that we already evicted from our buffer. Define M to be 
the median number of such items in a batch, where each batch has probability proportional 
to its fractional weight. Notice that at time t! at least a weight of \ of the scheduled batches 
remove an item that arrived at time t or later. If 5 is sufficiently small (so that \ — 5 > 25), 
a weight of at least 25 of those batches reaches t\. Thus, if we've reached t' without removing 
any items that arrived from to onwards, Case 2 applies. Moreover, in the interval [to,t'], at 
least 7 of the scheduled weight is on items that we've already evicted from our buffer before 
time to- This volume is held in the fractional buffer at time to- So A > (t' — to) /A. Thus, if 
either Case 3 holds or the assumptions under which we've analyzed Case 4 do not hold, we 
are left with the following situation. There is a time t' such that A > (t' — to)/7, and if we 
reach t! using only items currently in our buffer, then Case 2 holds. By Claim 15^21 below, the 
Case 3 procedure chooses colors with more than t' — to items that are in our buffer at time 
to- Any item that arrives after time to that we evict pushes us one step further beyond t'. 
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Therefore, after evicting all the Case 3 items, either we reach t\ or we can apply Case 2. All 
but one of the color blocks evicted by the Case 3 procedure are charged via Claim 15.21 If 
we reach t 1; the phase is charged as in Case 2. Otherwise, the phase is not charged, but the 
next phase executes either Case 1 or Case 2 and will be charged with probability at least |. 

Concluding the analysis, in expectation at least | of the phases are charged. The worst 
case is when repeatedly we have a Case 4 phase followed by a Case 3 phase followed by a 
Case 2 phase which is charged with probability |. ■ 

Claim 5.2. For every 5 > there exists 7 = 7(5) such that applying the Case 3 procedure 
starting at time to chooses color blocks totalling more t' — 1 items in our buffer at time t . 
Excluding one block, we can charge the eviction of each block with probability at least \ to 
constant fractional cost incurred before we complete its eviction. The same fractional cost is 
never charged more than once in all Case 3 procedure invocations. 

Proof. We relate the charge for chosen colors to the locking of volume that the fractional 
solution removes prior to time to of items that we hold at time to. Notice that the total such 
volume (locked and unlocked) is precisely A. Notice that in every s-subclass, one execution 
of the Case 3 procedure locks a volume of at most 25- 2 s . While this volume is locked, all the 
items of the evicted color block that locked it are kept in the fractional buffer with weight 
> 1 — 5. Let A^ denote the portion of A that is unlocked. We start by showing that A^ is 
close to A. More specifically, we show that A^ > ' A. To show this, we consider A as 
the volume in the fractional buffer and not in our buffer, and Ap as the unlocked volume in 
our buffer and not in the fractional buffer. 

Notice that each of our evictions of a color block B that is scheduled before to contributes 
to A the total weight in the fractional buffer of 5's items at time to- The sum of these 
contributions is exactly A. If our buffer at time to does not contain any items with volume 
that was locked for the eviction of B, then B contributes the same amount to A and A^. 
Suppose our buffer does contain items with volume locked by the eviction of B. The fractional 
solution holds at time to all such items with weight more than 1 — 5, otherwise we would 
have applied Case 1. Moreover, since this volume is still locked, then all the items of B 
must be held at time to by the fractional solution with weight at least 1 — 5 (otherwise, the 
volume assigned to B would become unlocked). So, if we write the contribution of B to A 
as (1 — 0)| B I, we get that (1 — 0)\B\ > {1 — 8)\B\. The volume that is locked because of 
B contributes at most 4<5|-B| to A — Ap, because \B\ is at least half the maximum size of a 
block in 5's subclass. Therefore, ^ > mm 6 < 6 > . 

Going back to the main argument, let Ci,c 2 ,...,c m be the colors in some s-subclass, 
sorted by non- decreasing order of the time one of their current items first drops to weight at 
most 1 — 5 in the fractional solution. Let q denote the color we choose from this subclass. 
Notice that the contribution to A^ of this subclass is at most 25 ■ 2 s , whereas we evict at 
least 2 S ~ 1 items from our time to buffer. The total A^ volume unaccounted for is less than 
• 2 s < 2<52 Smax , where s max is the maximum participating value of s (this includes 
classes from which we did not take any color block). To handle this portion of Ap that is 
unaccounted for, recall that we always also evict the largest color block, whose size is at 
least 2 Smax_1 . Notice that we might be counting this block twice, once as it might have been 
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chosen in an s max -subclass, and once as the largest block. Summarizing the argument, we 
have that the number of items we evict from our time t buffer is at least 

11. 1 - 55 

for an appropriate choice of 7. (The initial \ factor is for the double-counting of the largest 
block.) 

Finally, we deal with charging the cost of evicting the colors we choose. Consider the 
colors Ci, C2, . . . , c m in an s-subclass as defined above. Let Cj denote the median color in this 
subclass where colors are weighted by their contribution to W s . The probability that we 
choose a color with index j or larger is at least |. If this happens, we charge the fractional 
cost of at least YH=i w c % > f that generated the volume of colors c 1: . . . ,Cj that we are now 
locking. Otherwise, we don't charge the eviction of a block from this subclass. Notice that 
by the time the block with index j or larger releases the lock, the blocks for colors j or 
smaller have been evicted from our buffer (because of Case 1). Therefore, this cost is never 
charged again in a Case 3 procedure. Also notice that in expectation half of the Case 3 
evictions are charged. ■ 
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Proof of Lemma 12.11 Given an an input sequance X, let OPT^ be an optimal solution 
to the reordering buffer problem that uses a buffer of size k. We define an algorithm, ALG&/, 
that uses a buffer of size k! and the optimal solution OPTV In particular, alg^' is offline. (We 
abuse notation and denote by alg^/ and OPT^ also the cost of these respective solutions.) 
Observe that we may assume that after each time OPT^ finishes evicting a color, this color 
will not appear in the input sequence again. (After each eviction, we can rename all the 
following occurences with a new color c' without incurring any additional cost.) We can 
therefore denote by color i the z'th color that OPT^ evicts. Consider a time t > k' during 
the execution of alg^/. Denote by / the minimum color in alga's buffer. For any color c, 
denote by n(c) the number of items of color c in alga's buffer. For any color c > c/, define 
the potential 0(c) of color c by 0(c) = (c — c/ + l)n(c). Finally, we define for each color c a 
counter p(c) initialized to 0. Intuitively, this counter counts the number of items larger than 
c that were evicted so far. Notice that c/, n(c), 0(c), and p(c) are all a functions of t. The 
algorithm ALGfc' works as follows. 
For any time t. 

1. If the eviction of color Cf will evict the last item of this color in X, evict color c/. 

2. Otherwise, let c be the color with the maximum potential in alga's buffer. Evict 
exactly the n(c) items of this color currently in the buffer (without appending any 
aditional arriving items of the same color). 

If after the eviction we cannot execute Step 1, we update p{i) for every Cf < i < c, by 
increasing p(i) by n(c). 

We start by proving a bound on p(i). 

Claim .3. For any color i, at any time during the execution of the algorithm, p(i) < k — k! . 

Proof. Notice that it is sufficient to bound p(cf) in any point in time, as this is the maximum 
counter among the colors that their p(i) can still increase. Assume for contradiction that at 
a given time t the counter p(cf) became larger than k — kl (right after removing a color by 
Step 2). Consider this time t. Let n\ be the number of items alg^ evicted from items of a 
color smaller than Cf (this equals to the number of items with a color smaller than Cf). Let 
n 2 be the number of items ALG&/ evicted from color Cf. At time k + 1 + n\, OPT^ started 
evicting color Cf, therefore at time k + 1 + n\ + n 2 , if the buffer evicted at most n 2 items 
from color c/, evicting this color will reach the last item of c/. On the other hand, because 
p(cf) is at least the number of items from colors larger than c/ that were evicted so far, it 
holds that 

t > k' + 1 + m + n 2 + p(c f ) > k + 1 + m + n 2 . 

This is in contradiction to Step 2, as the counter is increased only if we cannot apply 
Step 1. ■ 

Next we show a lower bound on the potential. 

Claim .4. Consider a time t right before executing Step 2. The maximal potential is 
max c (t){c) > j^jj. 
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Proof. Denote s = 1+ ^ fc/ ■ Assume for contradiction that for any color c we have that 
0(c) < s. Therefore, c — Cf < s, and n(c) < c _ e s +1 , for every color c in the buffer. Because 
there are exactly k! items in the buffer, 

kl W L«J-c/+i 

c=cy c=c / i=l 

Thus, the claim follows. ■ 

We are now ready to prove our lemma. Notice that the number of times executes 
Step 1 is at most OPT fc . Furthermore, notice that in every execution of Step 2 , except for 
OPT fc executions, ^ c p(c) is increased by at least 1+ ^ nfc/ . Because J2 C P( C ) < — k')oPT k , 
there could be at most ( k ~ k '^+ ln fc,) c>PT fc executions. Therefore, 

/ (k - fc')(l + In k')\ 

ALG,, < (2 + ^ ^ <\ • OPT fc , 

and the lemma then follows. ■ 
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